Abstract 



An algebraic notion of representational consistency is defined. A theorem relating it to 
free actions is proved. A metrizability problem of the quotient (a shape space) is discussed. 
This leads to a new algebraic variety with a metrizability result. A concrete example is 
given from stereo vision. 
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1. Introduction 

One of the famous paradoxes of Zeno is Achilles and the tortoise. This 
paradox can be resolved by the continuum hypothesis, from which is proved 
the principle of mathematical induction. Because of the principle of induc- 
tion it is difficult to construct properties which hold for a set but which no 
members can possess individually. 

This paper introduces an algebraic notion of representational consistency 
from Computer Vision using Shape Spaces [5]. An existence theorem will be 
stated. We will construct an example of one of the existence conditions. But 
the other existence condition can only be met if the original set is split. This 
is intimately connected with human vision, pattern recognition and neural 
networks. Even this was a problem of considerable difficulty since it had 
been proved there are no general case view invariants [8] as was previously 
mentioned in [21] . The problem of constructing a non-trivial example of 
representational consistency without decomposing the original set remains 
unsolved. 

The idea bears a little resemblance to the Chinese Room paradox of 
John Searle [28]. Suppose you are in a room in which you receive Chinese 
characters, consult a book containing an English version of the aforemen- 
tioned computer program and process the Chinese characters according to 
its instructions. You do not understand a word of Chinese; you simply ma- 
nipulate what, to you, are meaningless symbols, using the book and what- 
ever other equipment, like paper, pencils, erasers and filing cabinets, are 
available to you. After manipulating the symbols, you respond to a given 
Chinese question in the same language. As the computer passed the Turing 
test [26j[27]this way, it is fair, says Searle [28], to deduce that he has done 
so, too, simply by running the program manually. "Nobody just looking at 
my answers can tell that I don't speak a word of Chinese". 
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The idea we will use is this. The fingers of my hand can represent one, 
two, three, four and five; but they can also represent two, one, three, four 
and five respectively Both these representations exist simultaneously, so we 
can pair them up to give the pair one, two repeated twice, and three, four 
and five, which no longer has the structure of five elements. Prom these two 
representations which exist simultaneously, a third can be constructed also. 
But this third representation has four elements not five, it loses the structure 
of five elements. 

I call this idea representational consistency both in analogy and to distin- 
guish it from logical consistency, and because the existence theorem is based 
on the idea of representation theory. Since I first came across an article 
about the Chinese Room paradox, I felt puzzled about my own paradox for 
many years. I had been discovering a number of new theorems since I was 
an intern at Monash University with Alan Pryde. After my honours thesis 
with John Stillwell (who is well known for his books like [301 ES]) I pub- 
lished them in [2]. These theorems concern a method for representing shape 
based on an equivalence relation on polygons [2]. Now commonly known as 
a binary string descriptor [3TJ [33j [32j HSJ, [20j [22]. My doctoral supervisor 
Alan Carey noticed a problem, something nobody else has noticed even in 
the ensuing decade, and pointed me in the direction of Algebraic Geometry 
[T2l [TT1 [U QUI [HI [13]. My other doctoral supervisor Michael Brooks intro- 
duced me to computer vision p2J[T9], particularly stereo vision. I then proved 
the theorem in [U [3j [6] . Some years later, in an attempt to generalize the 
binary string descriptor, I discovered that the paradox has a mathematical 
resolution. 

2. Representational consistency 

definition 1. Let S be a set that satisfies a collection of relations, called the 
structure of S. A representation of S is a one to one mapping from S to 
some other space I, which preserves the structure of S. 

R:S^I (1) 

The mathematical resolution to the paradox is provided by constructing a 
mapping analogous to the many mappings like the Plucker map and Veronese 
map that can be found in Algebraic Geometry [T7] . 

definition 2. Let R\, R 2 be two representations of S 

U R 2 ) = {(s, {Rtis)} U {R 2 (s)}) \ s£S} (2) 
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This defines an operator on mappings whose domain is S but it is not a 
linear operator. The representations are pairwise consistent if for all pairs 
of representations Ri, R2 the map U(i?i,i?2) is one to one. 

The following is a basic existence theorem. The existence follows by 
taking the contrapositive. 

theorem 1. Let the automorphisms of S be denoted Aut(S), and suppose 
they are a group under composition, which acts freely on S . If the repre- 
sentations of S are pairwise inconsistent then Aut(S) has elements that are 
involutions. 

Proof. If the representations of S are pairwise inconsistent then for some 
01)02 U (0i, 1^2) is not one to one. 

3s,t € S t {Ms), Ms)} = {Mt), <h(t)} (3) 
Since 4>\,4>2 are representations they are one to one so 

Ms)^Mt) Ms)^Mt) (4) 

Since Aut(S) is a group we have 

4>2 1 (Ms)) = t (5) 

<p- 1 (Ms)) = t (6) 

Hence 0j~ 1 020r 1 02(s) = s 

Since Aut(S) acts freely on S, id £ Aut(S) 0f 1 020f 1 02 = id 
Since U(0i, 02) is not one to one and 0i, 02 are one to one 0i 7^ 02 

(0rV 2 ) 2 = ^ (7) 

The group Aui(S') has an involution. □ 

3. Group action and non-metrizability 

Astrom p] had shown there are distinctively shaped curves which can be 
mapped arbitrarily close to a circle by projective transformations, such as 
sillouettes of a duck and a rabbit. It turned out to be related to free actions, 
or the lack thereof. 

The set of m dimensional linear subspaces of R n is denoted G(m, n) and 
is called a Grassmanian. 
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Let G be the set of (g, d) such that g G GL(4) and d is n x n diagonal. 



The action of GL(4) x diag{GL(n)) is not free 



Example 



A 



1 1 

1 1 2 

1 1 3 
115 



(8) 



a 



d = diag(a~ L ,p~ L ,'y~ 1 ,S~ L ,l,l) 



gAd = A 

Michael Murray (who is well known for his book on differential geometry 
and statistics [24]) brought to my attention the literature on Shape Spaces. 
He also suggested a trick by which I might be able to prove the theorem 
below. 

Theorem 

M = {me M 4n | 34 x 4 minor / Vi M 4i / 0} 

Mj (GL(m) x diag{GL(n))) is not Hausdorff 

Consequently no metric exists on Mj (GL(m) x diag(GL(n))) 

4. Restricted group action and metrizability 

Let G be the set of (g, d) such that 

g G GL{A) = 542 = 543 = and 544 = 1 (9) 

d e diag{GL{n)) n > 4 (10) 
Theorem If A e M and (g, d) € G 

gAd = A => g = Id and d = Id (11) 
5 



This restricted group action acts freely. 



f f 1 l] T (12) 
M={mGM|Vim 3i /0} (13) 

\/g:M^M 3d gi{X)d = i{gX) (14) 

Theorem The quotient of i(M) by the restricted action is a manifold 
and is metrisable 
Sketch 
X € M 

Since G acts freely and X i— > is continuous and open there is a 

neighborhood of the restricted orbit of i{X) homeomorphic to a neighbor- 
hood of the orbit of X in G(3, k — 1) 



t : M -> M 4 , n 



x y z t 



5. Non-metrizability theorems 

definition 3. ^4 Zie group H is said to act properly on a manifold M if and 
only if, for all compact subsets K contained in M , Hk = {g £ H \ gKPiK ^ 
0} is relatively compact in H. 

theorem 2. If H is a Lie group acting on a manifold M, then M/H is 
Hausdorff if and only if H acts properly on M . 

Proof. Suppose H acts properly on M but M/H is not Hausdorff. Consider 
x £ M, and {gy \ g £ H} C M. If there do not exist open sets separating 
them, then x is in the closure of {gy \ g £ H}. So there must be a sequence 
giy such that x is a limit point of the sequence. The set K = {x} U {giy j 
i = 1 . . . oo} is compact. Since H acts properly {g G H | gK n X / 0} is 
also compact. However, the sequence gi has no limit, so it is not closed in 
H. This contradicts the compactness of Hk, so M/H must be Hausdorff. 

Now suppose M/H is Hausdorff. Suppose also that K is a compact subset 
of M. Since M/H is Hausdorff, if x,y £ K belong to different orbits then 
there are open sets U, V containing x, y respectively, such that UdgV = for 
all g £ H. Thus for each subset of K belonging to a single orbit, we obtain 
an open set covering it that is disjoint from all other such open sets. In other 
words we obtain an open cover of K with disjoint members. By definition 
of compactness, this has a finite subcover, which must be the same cover, 
because the members were disjoint. Hence K consists of a finite number of 
disjoint subsets Ki each belonging to a single orbit of H . Each Ki must 
be compact, otherwise the compactness of K would be contradicted. Since 
Hk = {g £ H | gK n K ^ 0} is a finite union of an intersection of compact 
sets, it is compact. It follows that H acts properly on M . □ 
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lemma 1. GL(4) x diag(GL(n)) does not act properly on (R 4 \ {0})™ 
Proof. Let K = 0(4) x {p} c M 4j „. 

Take any g G GL(4). It has a singular value decomposition u g d g vj with 
d g G diag{GL{4)). 

Take fc = [v g ], v g G 0(4), d G diag{d~ 1 , 1, . . . , 1). 

Then <7&d = u g d g vj [v g ] d = [u g ]. 

So if p is a unit eigenvector of gfcd G 0(4) x {p}. 

Hence Hk contains the subset of GL(4) fixing p, which is non-compact. □ 

Since GL(4) x diag(GL(n)) does not act properly on M, M/(GL(4) x 
diag{GL{n))) is not Hausdorff. Consequently no metric exists on Mj (GL(4) x 
diag{GL{n))). 



6. Metrizability theorems 



In what follows, GL(m) denotes the set of non-singular m x m matri- 
ces, Id denotes the identity matrix, diag() denotes a diagonal matrix whose 
diagonal entries are listed inside the brackets, and M m>n denotes the set of 
m x n matrices. 

Let M be the set of 4 x n matrices which have at least one non-zero four 
by four minor and no entry of the fourth row zero. A matrix in M that has 
all entries of its fourth row equal to one represents n > 4 points in R 3 with 
at least four points not coplanar. 

Let G be the set of (g, d) such that g G GL(4) with 541 = 542 = 543 = 
and 044 = 1, and d is n x n diagonal and non-singular where n > 4. G is a 
groups under the product operation (gi,di) o (52,^2) = (gig2,d\d2). 

lemma 2. // A G M, (g, d) G G then gAd = A implies g = Id and d = Id. 

Proof. First we show this when n = 4. 

Suppose gAd = A, where A, g, d satisfy the stated hypotheses. 
Then A is invertible since it is 4 x 4 and has a non-zero 4x4 minor. 
Multiplying gAd = A by d^A~ l on the right, we obtain g = Ad~ 1 A~ 1 , so 
g is diagonalizable. 

Moreover, <i -1 is its eigenvalue matrix, and A is an eigenvector matrix of g. 

Suppose v and A are such that gv = Xv. 

Then 

541 542 543 1 v = Xv (15) 



G is in fact a Lie group 



Since columns of A are eigenvectors of g and no fourth row entry is zero for 
A G M, w 4 ^ 0. Thus A = 1. So all eigenvalues of g are equal to 1. Since 
d^ 1 is the eigenvalue matrix of g, it is the identity matrix. Since the inverse 
of the identity matrix is still the identity matrix, d = Id. Hence also g = Id. 

If n > 4, A £ M has a non-singular 4x4 minor A^ whose fourth row has 
no zero entries, where i = *2j *3>*4)- The columns ii,*2>*3>*4 of gAd are 
gA i diag{di 1 , di 2 , dj 3 , dj 4 ). The n = 4 case implies g = Id and , . . . , dj 4 = 1. 
Hence also d = Id. □ 



Let M be the subset of M whose elements (are matrices) with no entry 
of their third row equal to zero. Let t denote the map from M to M^ n 



which maps each column 



i T 



to 



1 



t represents 



x y z t 

perspective image formation. The fact that no entry of the third row is zero 
means that no point in the configuration represented by a matrix in M lies 
in the focal plane. 

theorem 3. If g £ GL(A) with gn = 542 = 543 = and 544 = 1, X € M, 
gX € M, then there is a unique non-singular n x n diagonal matrix d such 
that 

gi{X)d 



i{gX) 



Proof. Let 



511 


512 


513 


514 


521 


522 


523 


524 


531 


532 


533 


534 











1 



be an affine transformation. 



(16) 



(17) 



X 



Xl ■ 


Xn 


Vi ■ 


■ Vn 


zi ■ 


Zn 


ti ■ 


tn 



€ M 



(18) 



so zi, . . . , z n ^ 0, and X is not contained in a plane. 



(911X1 + 51251 + S13Z1 + guh)/z 1 

(921X1 + 52251 + 523^1 + 924h)/zi 

(931X1 + 53251 + 533^1 + 9uti)/zi 
h/zi 



(911x1 + 51251 + 513 zi + guh)/z n 

(921X1 + 52251 + 523^1 + 924h)/z n 
(931X1 + 53251 + 933Z1 + 534*l)/z n 
tn/ Zn 



(19) 
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gna:i+gl2yi+gi3^i+gi4*i 

931^1 +93291 +933^1 +334*1 
921X1 +92291 +323^1 +924*1 

i(gX) = 93121+93291 +933^1 +934*1 

tl 

93ia;i +9322/1+933^1 +934*1 



9ll^n+gi29n+9l3^n+gi4*n 

931 Xl +93291 +933 21 +934*1 
92ia:n+9229'i+923^n+924*n 

931^n+9322/n+933«n+934*n 

1 

*n 

93ia:n+9329n+9332n+934*n 



Let 



d = di05(« 1 /(53ia;i+5322/l+5332 f l+534<l) I • • • , W (031 +032^+033^+034^ )) 

(21) 

Since the third rows of X and have no non-zero entries, d is non-singular. 
Multiplying the right hand side of equation [19] by d we see that it equals the 
right hand side of equation[20l Thus gi{X)d = i{gX) which proves existence. 

Suppose (g',d') G G such that g'i(X)d' = i{gX). Then g' i{X)d' = 
i(gX) = gt{X)d. Unless g' = g and d' = d, (g'g^ 1 , d'd~ l ) fixes i(X). 
t(X) = XX for some A G diag{M ntn ). Since X £ M its third row has no 
zero entries, so A is non-singular, so XX G M, ie l(X) G M. By lemma 2.2, 
g' = g and d' = d. 

Thus <7 and d are unique. □ 

Consider the following action of the group G on t(M): 

9 : G x t(M) -> t(M) 6»((g, d), = c/i(X)d (22) 

We will restrict # to those elements of G x t(M) that satisfy gi{X)d = 
i{gX) and gX G M. We will call this the restricted action of G. 

Suppose the fourth coordinates U of all scene points equal 1. Then i 
becomes invertible. 

lemma 3. Let l{X) ~ l(Y) if there is a g G GL(4) such that g^\ = g^, = 
^43 = 0, 544 = 1 and t{X) = i{gY), where X, Y G l{M). This is an 
equivalence relation on i(M). 

Proof. Suppose t(X) = i{gY), so for each i 

Y u /Y 3i = (gX) u /(gX) 3i (23) 



Y 2i /Y 3i = (gX) 2i /(gX) 3i (24) 

1/Y 3i = l/(gX) 3i (25) 

Thus Y = gX. Since the set of g G GL{4) such that 541 = g^ 2 = 043 = 
and 344 = 1 are a group, ~ is an equivalence relation. □ 
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We can therefore see that the restricted group action partitions t(M) 
into equivalence classes. Thus there is a quotient space t(M)/ ~. We will 
introduce two lemmas from [7] to prove that this quotient space is a manifold. 

definition 4. An equivalence relation on a space X is called open if whenever 
a subset A C X is open, then [A] is also open. 

lemma 4. The equivalence relation associated with the restricted group ac- 
tion is open. 

Proof. Suppose g$i = 0,532 = 1,933 = 934 = 0. Then d is of the form 
diag(u/to) for vectors v,w, where division is componentwise, and v 7^ \w 
for all non-zero A. All d 7^ diag(A, . . . , A) can be realized like this. If 931 = 
1/A, #32 = 933 = 934 = then d = diag(A, . . . , A). Hence d is surjective. Let 
D be any open subset of the set of non-singular diagonal matrices. Since d in 
theorem [3] is a continuous function of g and X when gX G M, the preimage 
of D is an open set of (g,X). Thus the restriction is an open subset of 
G x t(M), and the restricted group action is continuous. It follows from this 
that the equivalence relation is open. □ 

lemma 5. An equivalence relation on X is open if and only if the map 
tt : X — > X/ ~, Ih [X] is open. When ~ is open and X has a countable 
basis of open sets, then Xj ~ has a countable basis also. 

Proof. See lemma 2.3 in [7]. □ 

lemma 6. Let ~ be an open equivalence relation on a topological space X. 
Then R = {(x,y) \ x ~ y} is a closed subset of the space X x X if and only 
if the quotient space Xj ~ is Hausdorff. 

Proof. See lemma 2.4 in [7]. □ 
lemma 7. The set 

{(l(X), t(Y)) I 3g l(X) = i(gY) X, gY G M} 
is a closed subset of l{M) x l(M). 

Proof. Since 1? = t, we can rewrite an equation of the form i{gX) = i(Y) 
into the form i{gX) = Y, where Y%i = 1. Then gX& = X& so i{gX) = Y is 
equivalent to 

(gX) li /(gX) 3i =Y li (26) 
(gX) 2i /(gX) 3i = Y 2i (27) 
(gX) 4i /(gX) 3i = Y 4i (28) 
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which is equivalent to 



(gX) 3i Y 4i = X 4i (29) 

(gX) 2i Y 4i = Y 2i X 4i (30) 

{gX)uY At = Y U X U (31) 
Denoting the columns of X by [x^y^z^ti) and the columns of Y by 

( Sll^i + 9\2Vi + 9i3^i + guti)t'i = Ux'i (32) 

(#21^ + g22Vi + 023^ + 524^K = (33) 

(531^ + g32Vi + g^zi + guU)t'i = U (34) 

It suffices to prove that a sequence X{ , Y^ satisfying such equations con- 
verges to X, Y also satisfying these equations. There is a unique solution 
to the equations (gX)uYa = YuX&i if and only if the matrix whose rows 
are (xji-,yji-, Zit'^tit'^tiX^) has rank 4. Since this matrix is the limit of a 
sequence in M, where the limit is also in M, its rank is at least 4. Since the 
determinants of all 5 x 5 minors of the matrices in the sequence are zero, 
the determinants of all 5 x 5 minors of the limit matrix are zero. Hence 
the rank is exactly 4. Similarly, there is a unique solution to the equa- 
tions (gX)iiY/±i = YnX^i. There is always a unique solution to the equa- 
tions {gX)^iYii = Thus we have shown the existence of g such that 
i{gX) = Y , It remains to prove that such a g is non-singular. Since i is 
invertible, t(gX) = i(Y) implies gX = Y. If det(g) = then Y M, a 
contradiction. □ 

theorem 4. The quotient space l{M)/ ~ is a manifold, and is metrisable. 

Proof. Prom lemma [7] and lemma [6] we see that l(M) is a Hausdorff space. 
From lemma |4] and lemma [5] this quotient space also has a countable basis 
of open sets. Given any X £ M, there is a neighbourhood of the restricted 
orbit of i(X) homeomorphic to a neighbourhood of the orbit of X in G(3, k — 
1), because G acts freely, and X ^> [l{X)\ is continuous and open. Thus 
i(M)/ ~ is locally euclidean. Coordinate neighbourhoods on t(M)/ ~ will 
be compatible because the corresponding neighbourhoods on G(3, k — 1) are 
compatible. Hence l{M)/ ~ is a differentiable manifold. We can use the 
metric on G(3, k — 1) to give us a metric on t(M)/ ~. □ 

It is actually the case that a metric exists on any manifold. This is a 
deep theorem called the Urysohn metrisation theorem |23j . 
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7. Conclusion 



What we have shown is that the stereo vision group acts freely. This 
is almost an example of representational consistency, but not if there are 
involutions. In general a matrix involution is a diagonal matrix surrounded 
on both sides by an orthogonal matrix and its transpose, with all diagonal 
entries ±1 but not all 1. Thus if the stereo vision group G is split up (and 
neural networks do this kind of thing) it becomes an example of representa- 
tional consistency. This is exactly like a binary string descriptor. 

Roger Penrose argues [26] that by use of our consciousness we are enabled 
to perform actions that lie beyond any kind of computational activity. In 
the sequel [27] Penrose exhibits a tiling problem that is non-computable. 
He argues that we perform non-computational feats when we consciously 
understand. The example of the stereo vision group is a construction in this 
direction. 

In my lectures [6j I begin with geometric optics, because that is essentially 
all that one needs to assume. The rest is deduced from mathematics. It is 
truly remarkable that optics can reveal something like this. 

There is much more to be said on this subject. It was reached via 
Kendall's shape spaces, which offers a beautiful and vivid demonstration 
of representational consistency. The optics of the eye have been studied 
since ancient times, and are being modelled in great detail for example [16] . 
There is a stronger definition of representational consistency, which I have 
not covered either. For this automorphisms of free groups are of interest, a 
recent article [25] . And it seems possible to find representational consistency 
in biochemistry. 
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